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Abstract 

It is common to characterise networks based on their statistical properties. It has been shown 
that social networks, such as networks of co-starring film actors, collaborating scientists and email 
communicators, exhibit a positive first-order assortative behaviour, i.e. if two nodes are connected, then 
their degrees (the number of links a node has) are similar In contrast, technological and biological 
networks, such as the Internet and protein interactions, exhibit a disassortative behaviour, i.e. high-degree 
nodes tend to link with low-degree ones. For social networks, a node's degree has been assumed to be a 
proxy for its importance/prominence within the network, and the assortative behaviour is then interpreted 
as indicating that people mix with people of comparable prominence. In this paper we introduce a new 
property, second-order assortative mixing, which measures the correlation between the most prominent 
neighbours of two connected nodes, rather than the prominence of the nodes themselves. Five social 
networks and six other networks are examined. We observe very stronge second-order assortative mixing 
in social networks. This suggests that if two people interact in a social network then the importance of 
the most prominent person each knows is very likely to be the same. This is also true if we measure 
the average prominence of neighbours of the two people. This property is weaker or negative in non- 
social networks. We investigate a number of possible explanations for this property, including statistical 
significance, neighbourhood size, power-law degree distribution, cluster coefficient, triangles and bipartite 
graphs. However, none of these properties was found to provide an adequate explanation. We therefore 
conclude that second-order assortative mixing is a fundamental property of social networks. 
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Fig. 1. Illustration of a part of a network depicting actors (nodes) and the films (links) they have acted together in. 



A network or graph consists of nodes (also called vertices) connected together via links. Networks are 
utilised in many disciplines. The nodes model physical elements such as people, proteins or cities, and 
the links between nodes represent connections between them, such as contacts, biochemical interactions, 
and roads. For example, a portion of a network describing actors and the films they have acted together in 
is depicted in Fig. [J In recent years studying the structure, function and evolution of networked systems 
in society and nature has become a major research focus H], 0, 0, H, 0, f6l. 

The connectivity or degree, k, of a node is defined as the number of links the node possesses. The 
probability distribution of node degrees is indicative of a network's global connectivity. For example 
random graphs with a Poisson degree distribution [Tl have most nodes with degrees close to the average 
degree. In contrast, many complex networks in nature and society are scale-free graphs ||8l exhibiting a 
power-law degree distribution, where many nodes have only a few links and a small number of nodes have 
a very large number of links. However, the degree distribution alone does not provide a full description 
of a network's topology. Networks with identical degree distributions can possess other properties that 
are vastly different |[9l. ifTOl. lITTTl. 

One such property, is the mixing pattern between the two end nodes of a link ||T2| , fT3l|, i.e. the joint 
probability distribution of a node with degree k being connected to a node with degree k'. In general, 
biological and technological networks are disassortative meaning that well-connected nodes tend to link 



I. Background 
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Fig. 2. An example network. The assortative coefficients are calculated by the excess degree which is degree minus one [13]. 
Consider the link between nodes A and B, the (excess) degree of node A is 5. The neighbours maximum (excess) degree of 
node A is 7, which is the (excess) degree of node C. 

with poorly-connected nodes, and vice versa. In contrast, social networks, such as collaborations between 
film actors or scientists, exhibit assortative mixing, where nodes with similar degrees are connected. 

To quantify this mixing property, Newman |[T3l proposed the assortative coefficient, r, where — 1 ^ 
r ^ 1. It is derived by considering the Pearson correlation between two sequences, where corresponding 
elements in the two sequences represent the degree of the nodes at either end of a link in the network. . 
When r ^ 1 there is a perfect assortative mixing in the network, i.e., every link connects two nodes with 
the same degree. When r ~ there is no correlation. In terms of degree mixing, the network is random or 
neutral. If the network is perfectly disassortative, i.e., every link connects two nodes of different degrees, 
r is negative and has a value lying in the range — 1 ^ r < 0. The mixing pattern has been regarded as a 
fundamental property of networks and the assortative coefficient r has been widely used to measure this 
property. 

II. Second-Order Assortative Mixing 

We now introduce and define a related property which we refer to as the second-order mixing. Consider 
a link i connecting two nodes, a and b, with degrees ka and k},. Let N^^^i^ denote the set of node a's 
neighbours, excluding h, i.e. all the other nodes that a is directly connected to. Each of these neighbours, 
Ua, has a corresponding degree, kn^- Following Newman's definition of the (first-order) assortative 

'For a directed network, the degree of the starting node of a link is contained in one sequence, and the degree of the ending 
node is in the other sequence. The number of elements in each sequence is the number of links. For an undirected network, 
as all the networks studied in this paper, each undirected link is replaced by two directed links pointing at opposite directions. 
Thus the number of elements in a sequence is twice the number of links. 
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coefficient r which is based on the degrees of the two end nodes of a link, we define the second-order 
assortative coefficient TZmax as 



7? - i 



\l-'Y.{k, + k'^ 



2 ' 



(1) 



2 

where L is the number of links, Ki and K[ are the maximum degree of neighbours of the two nodes 
connected by link i, i.e. Ki = max(/i:„^ : Ua G -^a\b) and K[ = max(A;„^ : rn, G Ny^^^). Similarly 
we define the second-order assortative coefficient TZavg based on the neighbours' average degrees by 
replacing Ki and K[ in the above equation as = ^ E„„eAf„\6 = En^GAf^a 

that when calculating the first and second assortative coefficients, we actually use the excess degree [13] 
which is degree minus oneQ See Fig. |2] for examples. 

Values of the assortative coefficients, r, TZavg and TZmax are provided in TablelH We consider eleven 
networks, including five social networks, two biological networks, two technology networks, and two 
synthetic networks based on random connections Q and the Barabasi and Albert ||8l model, respectively. 

The expected standard deviation a on the value of assortative coefficient r can be obtained by the 
jackknife method [22 1 as cr^ = '}2d=ii'''i ~ ^)^' where is the value of r for the network in which the 
i-th link is removed and i = 1,2, ...L. And likewise for second-order assortative coefficients TZmax and 
TZavg- For all cases shown in Table 1, the value of a is very small (< 0.03), which validates the statistical 
significance of the coefficients. 

The comparison between the first and second-order assortative coefficients is remarkable. For the five 
social networks, while both the first and second order coefficients are positive, the value of the second- 
order assortative coefficients, in terms of both neighbours average degree and neighbours maximum 
degree, are very much higher. The strongest effect of second-order mixing is observed for the Film 
actor. Scientist and Secure email networks. The weakest (though still strong) effect is exhib- 
ited by the Email network. The fact that the Secure email network, using the Pretty Good Privacy 
(PGP) protocol ifTSl . exhibits a much stronger effect is probably due to the nature of this network which 
explicitly relies on endorsement between members of the network for the purpose of authentication 
and encryption. Interestingly, while the Email and Musician networks exhibit very low first-order 
assortative mixing, they exhibit strong second-order assortative mixing. 

^ This is because, when considering two connected nodes, A and B, the neighborhood of A is defined to exclude B. And 
hkewise for the neighborhood of B. 
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TABLE I 

Properties of the networks under study. Social networks: (a) Film actor collaborations |[8|, where two 

ACTORS ARE CONNECTED IF THEY HAVE CO-STARRED IN A FILM; (B) SCIENTIST COLLABORATIONS ifTTj . WHERE TWO 
SCIENTISTS ARE CONNECTED IF THEY HAVE COAUTHORED A PAPER IN CONDENSE MATTER PHYSICS; (C) SECURE EMAIL 

NETWORK 1 151, WHERE TWO USERS ARE CONNECTED IF THEY HAVE USED THE PRETTY GOOD PRIVACY (PGP) 
ALGORITHM FOR SECURE EMAIL EXCHANGE; (D) EMAIL EXCHANGES BETWEEN MEMBERS OF A UNIVERSITY lfT6l ; AND 

(E) Jazz musician network IfTTl , where two musicians are connected if they have played in a band. 
Biological networks: (f) C. elegans metabolic network ifTsll . where two metabolites are connected if they 
participate in a biochemical reaction; and (g) the protein interactions of the yeast Saccharomyces 
cerevisiae |fT9l . Il20l . Technological networks: (h) Western States Power Grid of the United States flT\ \ 
AND (I) Internet Hi) ( http: //www .r outeviews.0RG/j . where two service providers are connected if they 
have a commercial agreement to exchange data traffic. Synthetic networks: (j) the random graphs Q; 

AND (K) THE BARABASI-ALBERT (BA) GRAPHS ID. PROPERTIES SHOWN ARE THE NUMBERS OF NODES AND LINKS; THE 
ASSORTATIVE COEFFICIENTS r, TZavg AND TZmax, BASED ON DEGREE, NEIGHBOURS AVERAGE DEGREE AND NEIGHBOURS 
MAXIMUM DEGREE, RESPECTIVELY, EACH COMPARING WITH ITS EXPECTED STANDARD DEVIATION a; AND THE AVERAGE 

CLUSTERING COEFFICIENT (C). 



Network 


Nodes 


Links 




r 


Or 








^max 


(C> 


(a) Film actor 


82,593 


3,666,738 


0. 


,206 


0.013 


0.836 


0.027 


0.813 


0.009 


0.75 


(b) Scientist 


12,722 


39,967 


0. 


161 


0.007 


0.680 


0.014 


0.647 


0.005 


0.65 


(c) Secure email 


10,680 


24,316 


0. 


,238 


0.007 


0.653 


0.009 


0.680 


0.007 


0.27 


(d) Email 


1,133 


5,451 


0. 


,078 


0.014 


0.242 


0.014 


0.247 


0.014 


0.22 


(e) Musician 


198 


2,742 


0. 


,020 


0.019 


0.543 


0.023 


0.307 


0.029 


0.62 


(f) Metabolism 


453 


2,025 


-0. 


,226 


0.011 


0.265 


0.032 


0.263 


0.023 


0.65 


(g) Protein 


4,626 


14,801 


-0. 


137 


0.008 


-0.046 


0.007 


0.033 


0.009 


0.09 


(h) Power grid 


4,941 


6,594 


0. 


,004 


0.014 


0.205 


0.015 


0.258 


0.016 


0.08 


(i) Internet 


11,174 


23,409 


-0. 


195 


0.001 


-0.097 


0.004 


0.036 


0.008 


0.30 


(j) Random graph 


10,000 


30,000 







0.009 





0.011 





0.006 





(k) BA graph 


10,000 


30,000 







0.004 





0.008 





0.008 






Other forms of networks do not exhibit the very strong second-order correlations exhibited by social 
networks. However, the Metabolism and Power grid networks do exhibit second order assortative 
mixing comparable to the Email network, the weakest of the social networks. For the Internet 
and Protein networks the second order assortative mixing is almost zero {Rmax, or negative (Ravg)- 
As expected, synthetic networks generated by generic graph models are completely uncorrelated, i.e. 

Rmax — Ravg — 0. 

Fig-ElA) shows the first-order assortative mixing of the Scientist network by plotting the frequency 
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Fig. 3. The first and second-order assortative mixing in the Scientist network. We show the frequency distribution of 
links as functions of (A) degrees k and k' of the two end nodes of a link, with k ^ k'; (B) the neighbours' maximum degrees 
of the two nodes, Kmax and K'^ax, with Kmax ^ K'^ax\ and (C) the neighbours' average degrees, Kavg and K'ayg, with 
Kavg S5 K'^yg, respectively. (D) is (B) for the randomised case of the network preserving the degree distribution. The maximum 
degree of the network is 97. 



distribution of the number of links as a function of degrees, k and k', of the two ends of a link. While 
we observe a strong correlation between degrees less than 20, there is no correlation beyond this. The 
frequency distribution rapidly decreases with increasing degree, as expected for a scale-free network. 
In contrast, Fig. [3^6) and (C) show the second-order assortative mixing by plotting the distribution of 
links as functions of the neighbours maximum degrees, Kmax and K'^^^, and the neighbours average 
degrees, Kavg and K'^^g, respectively. In Fig. OB), we observe a very strong correlation for almost all 
Kmax values. Moreover, the link distribution along the diagonal does not decrease with the increase of 
Kmax, but is approximately flat over two orders of magnitude. Of course the correlation in Fig. [UB) is 
not perfect, and a second process appears to be uniform noise. If the average rather than the maximum 
is considered, Fig. OC), we still observe a strong correlation. The noise might be better modelled as 
Gaussian which is probably due to the summation of many nodes and the central limit theorem. 
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TABLE 11 

Null hypothesis test. The average assortative coefficients, (r'), {TZ'avg) and (TZ'max), and their standard 

DEVIATION, 5, OBTAINED AFTER RANDOM PERMUTATION OF ONE OF THE TWO VALUE SEQUENCES. 



Network 


(r') 








{T^rnax ) 




(a) Film actor 


0.00001 


0.00034 


0.00000 


0.00034 


0.00001 


0.00036 


(b) Scientist 


-0.00029 


0.00360 


0.00010 


0.00334 


0.00035 


0.00343 


(c) Secure email 


-0.00065 


0.00422 


0.00043 


0.00470 


-0.00020 


0.00498 


(d) Email 


0.00021 


0.01078 


0.00177 


0.00861 


0.00101 


0.00970 


(e) Musician 


0.00050 


0.01250 


-0.00168 


0.01380 


-0.00026 


0.01300 


(f) Metabolism 


-0.00021 


0.01469 


-0.00106 


0.01518 


0.00134 


0.01460 


(g) Protein 


0.00000 


0.00536 


0.00058 


0.00513 


-0.00072 


0.00520 


(h) Power grid 


-0.00171 


0.00842 


0.00152 


0.00886 


-0.00039 


0.00811 


(i) Internet 


-0.00033 


0.00395 


0.00072 


0.00459 


0.00001 


0.00422 



III. Significance 

In this section we first test tlie statistic significance of the second-order mixing property. Then we 
address the question of whether the second-order mixing is a new topological property, i.e. whether it 
can be explained by other known properties of the networks. 

A. Null Hypothesis Test 

A high correlation score between two value sequences must be tested against the null hypothesis. 
For each network and each coefficient, we randomly permuted the order of values in one of the two 
sequences and re-computed the coefficient. This was repeated 100 times and then we calculated the 
mean and standard deviation of the coefficient. Results are shown in Table |lll Typically the mean value 
is close to zero and the standard deviation is small. These results confirmed the statistical significance 
of the first and second-order assortative coefficients, reported in Table Jl 

B. Increasing Neighbourhood Size 

The strong correlation scores associated with second-order assortative mixing could simply be due to 
the increased extent of neighbourhood (from distance of one hop to two hops), i.e. a node always has 
more second-order neighbours than first-order neighbours. To exclude this possibility we also examined 
the Xth-order assortative coefficients, TZmax and TZavg, which are calculated using the maximum or 
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average degree within the neighbourhood of no more than X hops from each end of a link. We observed 
that for all networks under study, the values of the 3rd-order coefficients were smaller than the 2nd-order 
coefficients. This shows the second-order assortative mixing is not caused by increased neighbourhood 
(alone). Of course, if the neighbourhood continues to increase, eventually the coefficients will increase 
and tend to one. This is to be expected since eventually, the neighbourhood encompasses the entire 
network. 

C. High-Degree Nodes 

Another possible explanation for the high values of second-order assortative coefficients considered, is 
that there are a few hub nodes that are extremely well connected and dominate the network structure. To 
test this we removed the best-connected node (together with the links attaching to it and any resulting 
isolated nodes) from the networks and re-computed the coefficients. We also calculate the coefficients 
after removing the top 5 best-connected nodes. Results are shown in Table JII] In all cases, the coefficients 
change very little. For some networks, such as the Secure email. Musician and Metabolism 
networks, the second-order coefficients became stronger after the best-connected nodes are removed. 

D. Power-Law Degree Distribution 

While high degree nodes do not explain the high second order assortative mixing scores, the underlying 
heterogenous power-law structure of the networks was also considered as an explanation. To exclude this 
possibility we used the random link rewiring algorithm ||9l, ifTTI to produce surrogate networks that 
preserve the degree distribution of the network under study. The process is as follows. Firstly we choose 
a pair of links at random. The links should have four different end nodes. Secondly we randomly rewire 
the two links between the four end nodes. We only accept rewired links if no duplicate link is generated 
and the network remains as a single component; otherwise we keep the original links. Then we repeat 
the above two steps a sufficiently large number of times, usually a thousand times of the number of 
links in the network. This process does not change each node's degree and therefore preserves exactly 
the network's original degree distribution. The resulting network can be regarded as a randomised case 
of the original network. 

Fig. IHD) illustrates the distribution of links as a function of K^ax and K'^^^ for a randomised case 
of the Scientist network. The second-order assortative mixing in the original network disappears 
completely in the randomised case, where the second-order mixing appears to be modelled as uniform 
noise. This result shows that the second-order mixing is not constrained by a network's degree distribution. 
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TABLE III 

ASSORTATIVE COEFHCIENTS OF THE NETWORKS AFTER REMOVING (1) THE BEST-CONNECTED NODE OR (2) THE TOP 5 
BEST-CONNECTED NODES. ALSO SHOWN ARE THE NUMBERS OF NODES, LINKS AND THE MAXIMUM DEGREE, kmax, OF THE 

RESULTED NETWORKS. 



Network 


Nodes 


Links 


T 


r 


'7? 

f^avg 




(a) Film actor 


82,593 


3,666,738 


3,789 


0.206 


0.836 


0.813 


Remove 1 best-connected node 


82,582 


3,662,949 


3,701 


0.208 


0.837 


0.814 


Remove 5 best-connected nodes 


82,578 


3,648,821 


3,229 


0.213 


0.839 


0.825 


(b) Scientist 


12,722 


39,967 


97 


0.161 


0.680 


0.647 


Remove 1 best-connected node 


12,721 


39,870 


89 


0.154 


0.674 


0.633 


Remove 5 best-connected nodes 


12,717 


39,547 


72 


0.157 


0.673 


0.623 


(c) Secure email 


10,680 


24,316 


205 


0.238 


0.653 


0.680 


Remove 1 best-connected node 


10,675 


24,111 


162 


0.267 


0.651 


0.591 


Remove 5 best-connected nodes 


10,616 


23,603 


101 


0.354 


0.703 


0.631 


(d) Email 


1,133 


5,451 


71 


0.078 


0.242 


0.247 


Remove 1 best-connected node 


1,131 


5,380 


51 


0.072 


0.211 


0.161 


Remove 5 best-coimected nodes 


1,127 


5,180 


49 


0.058 


0.195 


0.128 


(e) Musician 


198 


2,742 


100 


0.020 


0.543 


0.307 




197 


2,642 


95 


0.057 


0.583 


0.615 


Remove 5 best-connected nodes 


193 


2,345 


55 


0.056 


0.442 


0.357 


(f) Metabolism 


453 


2,025 


237 


-0.226 


0.265 


0.263 


Remove 1 best-connected node 


452 


1,788 


122 


-0.222 


0.195 


0.304 


Remove 5 best-connected nodes 


448 


1,385 


72 


-0.153 


0.352 


0.338 


(g) Protein 


4,626 


14,801 


282 


-0.137 


-0.046 


0.033 


Remove 1 best-connected node 


4,574 


14,519 


197 


-0.140 


-0.026 


0.080 


Remove 5 best-connected nodes 


4,497 


13,937 


115 


-0.128 


0.012 


0.074 


(h) Power grid 


4,941 


6,594 


19 


0.004 


0.205 


0.258 


Remove 1 best-connected node 


4,939 


6,575 


18 


0.005 


0.189 


0.229 


Remove 5 best-connected nodes 


4,917 


6,515 


13 


0.008 


0.172 


0.217 


(i) Internet 


11,174 


23,409 


2,389 


-0.195 


-0.097 


0.036 


Remove 1 best-connected node 


10,782 


21,020 


1,333 


-0.214 


-0.049 


-0.039 


Remove 5 best-connected nodes 


9,368 


17,155 


560 


-0.192 


0.004 


0.047 
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Fig. 4. Second-order mixing coefficients vs average clustering coefficient 



i.e. networks with identical degree distribution could have hugely different mixing patterns both in the 
first-order Q, ifTTI and in the second-order. It highlights the limitation of characterising network topology 
by degree distribution alone and the critical importance of appreciating other topological properties as 
well. 

E. Clustering Coefficient 

We also examined whether the second-order assortative mixing is a consequence of the clustering 
behaviour observed in many social networks, where one's friends are also friends of each other. This 
is quantified by the clustering coefficient, Cj, which is defined as Cj = ^^^'2i)/2 ' where ki is the 
degree or the number of neighbours of node i, the denominator ki{ki — l)/2 is the maximum possible 
connections between the neighbours, and Cj is the actual number of connections between them |2|. The 
average clustering coefficient, (C), is the arithmetic average over all nodes in the network. Comparison 
of (C) with either TZavg or TZmax in Table|I] and Fig.|4] shows that high values of the second-order 
assortative coefficients occur for both high and low values of clustering coefficient. For example, while 
the Scientist network has high clustering coefficient and high second-order assortative coefficients, 
the Secure Email network has low clustering coefficient and high second-order assortative coefficient. 
On the other hand, the Internet has an average clustering coefficient higher than the two email networks, 
but the Internet's second-order assortative coefficients are the lowest. 
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Fig. 5. Average clustering coefficient of fc-degree nodes. 



Fig|5] reveals that the Scientist network and the Secure email network are fundamentally 
different not only in the average clustering coefficient but also in the relation between clustering coefficient 
and node degree. Yet they have similar high values of IZavg and TZmax- In contrast, the Scientist 
network and the metabolism exhibit similar values for the average clustering coefficient, but their 
TZavg and TZmax valucs are significantly different. This suggests that the second-order assortative mixing 
is something quite unexpected, particularly considering the work on the hierarchical organisation of 
complex networks |[23l . |[24ll . 

F. Triangles 

It is interesting to consider how many of the two neighbours of maximum degree are one and the 
same, i.e. the most prominent contact at each end of a link is the same person, and therefore forms a 
triangle. Let X denote the degree difference between the most prominent neighbour of the two end nodes 
of a link, i.e. X = \Kmax — -f^maa l' and L^x denote the number of links with X < x. TableHVl shows 
the ratio of L<4, L<2 and L<i to the total number of links, L, respectively. Note that L<i represents 
the case where Kmax = -f^max- ^^^^ shown is L^/L, where La is the number of links for which the 
most prominent neighbour of the two end nodes are one and the same node (i.e. La G L<i). 

For the case of the Film actor network, the Unk ratio values are almost the same, 34%, indicating 
that the second-order mixing behaviour in this network is largely due to the triangle case where the most 
prominent neighbour of a link's two end nodes are one and the same node. The same is true for the 
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TABLE IV 

Link ratio values of the networks under study. 



Network 










Rfnax 


(a) Film actor 


34.2% 


34.2% 


34.1% 


34.1% 


0.813 


(b) Scientist 


50.2% 


45.4% 


42.9% 


41.6% 


0.647 


(c) Secure email 


43.2% 


37.8% 


35.5% 


34.6% 


0.680 


(d) Email 


26.8% 


20.2% 


15.0% 


12.8% 


0.247 


(e) Musician 


56.3% 


56.0% 


55.7% 


55.7% 


0.307 


(f) Metabolism 


51.3% 


51.1% 


50.8% 


50.7% 


0.263 


(g) Protein 


10.5% 


8.0% 


6.4% 


5.7% 


0.033 


(h) Power grid 


68.1% 


38.3% 


19.1% 


7.8% 


0.258 


(i) Internet 


20.5% 


20.3% 


20.2% 


20.2% 


0.036 



Musician network. However, for other social networks, e.g. the Scientist and Secure email 
networks, we observe that while a significant portion of links share a common most prominent neighbour, 
there is a further 1% of links (L^i/L — L/^/L) of which the most prominent neighbours are not the same 
but have the same degree value, more than 2% of links whose neighbours maximum degrees differ by 
one, and 7% of links whose neighbours maximum degrees differ between one and three. TableHVl also 
shows there is no correlation between the link ratio L/^/L and the coefficient Rmax- Clearly neither the 
clustering coefficient nor triangles provide an adequate explanation for our observations. 

G. Bipartite Network 

A bipartite network is a network with two non-overlapping sets of nodes A and T, where all links 
must have one end node belonging to each set. For example, actors star in films, scientist write papers, 
and musician play in bands. The Film actor. Scientist and Musician networks under study are 
constructed from bipartite networks, e.g. two actors are linked if they co-star in a film and two scientists 
are linked if they co-author a paper. 

The Film actor. Scientist and Musician networks all exhibit strong second-order assortative 
mixing (see Table IB. It is therefore reasonable to ask whether the second-order assortative mixing can be 
attributed to the nature of bipartite networks? For example, all actors of one movie constitute a complete 
subgraph, in which everyone connects with the highest-degree node in the group. 

However, we found no support for this hypothesis. Firstly, the Metabolic network is also constructed 
from a bipartite network where the two types of nodes are metabolites and reactions. Two metabolites 
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Fig. 6. Second-order assortative coefficients Rmax and Ravg vs first-order assortative coefficient r. 



are linked if they participate in a reaction. The Metabolic network, however, does not show a strong 
second-order assortative mixing. 

Secondly, the Secure email network is a non-bipartite network, where two email users are linked 
by direct email communications. The Secure email network exhibits one of the strongest second- 
order assortative mixing, whose second-order assortative coefficients are significantly higher than the 
Musician and Metabolic networks. 

Bipartite networks might be a contributing factor, but they are not an adequate explanation for the 
second-order assortative mixing. 

H. Relation Between First and Second-Order Mixing Coefficients 

Fig. [6] shows that the assortative coefficient r is loosely correlated with the second-order coefficients 
Rmax and Ravg- Howcvcr, we notice that the first-order and the second-order assortative properties are 
not trivially related. In most cases the second-order mixing is much stronger than the first-order mixing. 
Moreover there are exceptions where the two are not correlated. Consider the Metabolic network and 
the Email network, the former is strongly disassortative with r = —0.226, whereas the later is assortative 
with r = 0.078 (see Table 1). Yet boith networks exhibit similar positive values for the second order 
mixing coefficient. 
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IV. Conclusion 

For social networks, the degree of a node, i.e. the number of people a person is linked to, is often 
considered a proxy for the prominence or importance of a person. First order assortative mixing has then 
been interpreted as indicating that if two people interact in a social network then they are likely to have 
similar prominence. However, the much stronger second order assortative mixing suggests that if two 
people interact in a social network then the importance of the most prominent person each knows is very 
Ukely to be the same. 

Our experimental results demonstrated very strong second order assortative mixing in social networks, 
but weaker, or even negative values for biological and technological networks. We examined a larger 
variety of other network properties in an effort to establish whether second-order assortative mixing was 
induced from other network properties such as its power law distribution, cluster coefficient, triangles 
and bipartite graphs. However, none of these properties was found to provide an adequate explanation. 
We therefore conclude that second-order assortative mixing is a fundamental property of social networks. 

Whether our most prominent contacts serve to introduce us (since much of the time they are the same 
person) or we simply prefer to mix with people who know similarly important people, remains an open 
question. Our work reveals a new dimension to the hierarchical structure present in social networks. We 
expect that our work will provide new clues for studying the structure and evolution of social networks 
as well as complex networks in general. 
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